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Using  the  Generalized  Hessenberg  Representation  (GHR),  the  concept  of  aggregation  is 
extended  to  systems  which  nearly  aggregate.  Near  aggregation  is  given  a  geometric 
interpretation.  Then  near  unobservability  (defined  as  an  invariant  subspace  near  the  null 
space  of  C)  is  introduced  and  is  shown  to  be  equivalent  to  near  aggregation  if  there  exists 
an  appropriately  dimensioned  invariant  subspace.  These  results  depend  on  the  introduc¬ 
tion  of  a  topology  into  the  state  space,  a  novel  feature  of  our  approach.  Finally,  near 
aggregation  is  shown  to  correspond  to  almost  pole-zero  cancellation  for  a  certain  class  of 
systems. 


1.  INTRODUCTION 


Model  reduction  has  long  been  a  topic  of  interest,  and  it  remains  relevant  to  control 
problems  of  today.  For  example,  large  flexible  space  structures  are  modeled  by  finite  ele¬ 
ment  approximations  of  large  order.  Unfortunately,  the  computational  constraints  of 
on-board  digital  controllers  make  complex  control  algorithms  based  on  large  models 
infeasible.  Thus,  the  control  design  requirements  call  for  a  simple  model  for  closed-loop 
control. 

Aggregation  was  proposed  by  Aoki  (1968)  as  one  method  of  model  reduction.  This 
concept  was  extended  by  the  Generalized  Hessenberg  Representation  (GHR)  by  Tse,  et  al 
(1978).  In  this  report,  we  investigate  further  properties  of  the  GHR.  In  particular,  we 
discuss  systems  which  nearly  aggregate.  This  idea  has  intuitive  appeal;  however,  it 
turns  out  to  be  dependent  on  the  scaling  in  the  system.  The  usual  algebraic  formulation 
of  state  space  models  does  not  directly  incorporate  scaling  in  the  model.  Since  the  scaling 
issue  appears  directly  in  near  aggregation,  we  incorporate  it  in  the  model  by  attaching  an 
inner  product  to  ihe  state  space.  This  inner  product  leads  to  a  topological  structure  in  the 
model  which  allows  us  to  formalize  near  aggregation.  Bart,  et  al  (1980)  used  a  similar 
idea  in  the  factorization  of  transfer  matrices. 

Near  aggregation  is  related  to  the  observability  structure  of  the  system  through  the 
use  of  geometrical  concepts  (Wonham,  1979).  To  establish  this  connection,  the  concept  of 
near  unobservability  is  introduced.  Roughly,  a  system  is  nearly  unobservable  if  there 
exists  an  invariant  subspace  near  the  null  space  of  C.  It  is  shown  that  if  an  appropriate 
dimensioned  invariant  subspace  exists,  then  near  aggregation  and  near  unobservability  are 
equivalent  concepts  from  a  geometrical  point  of  view.  This  generalizes  the  results  of  Aoki 
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( 1%8)  and  Tse,  et  al  ( 1078). 

It  is  well  known  thal  unobservable  systems  exhibit  a  pole-zero  cancellation  in  the 
transfer  function.  It  is  shown  below  that,  under  certain  conditions,  it  is  possible  to  gen¬ 
eralize  this  result.  That  is,  under  these  conditions,  nearly  unobservable  systems  exhibit 
an  almost  pole-zero  cancellation.  Hence,  near  aggregation  corresponds,  roughly,  to  remov¬ 
ing  an  almost  pole-zero  cancellation,  a  well-known  procedure  in  classical  control. 

Aggregation  has  been  related  to  several  other  model  reduction  methods  (Lindner  and 
Perkins,  1984).  The  fact  that  the  results  here  are  used  as  a  measure  of  observability  sug¬ 
gests  that  this  approach  may  be  related  to  balancing  (Moore,  1981).  However,  there  are 
fundamental  differences  in  the  two  approaches.  Balancing  requires  the  computation  of  the 
controllability  and  observability  grammians.  Hence,  this  technique  is  most  useful  for 
stable  systems.  Near  unobservability  is  determined  from  the  algebraic  and  topological 
properties  of  the  vector  space  and  operators  associated  with  the  system.  Stability  of  the 
differential  equation  is  not  an  issue.  Secondly,  balancing  has  not  been  related  to  zeros. 
Nonetheless,  it  would  be  interesting  to  compare  these  two  concepts.  Unfortunately,  a 
complete  theory  is  not  available  to  date.  Some  preliminary  results  may  be  found  in 
Lindner  and  Perkins  ( 1984). 

The  report  is  organized  as  follows.  Section  2  introduces  the  GHR  and  relates  it  to 
near  aggregation.  Section  i  discusses  the  geometry  of  the  GHR  and  near  aggregation.  Sec¬ 
tion  4  introduces  near  unobservability  and  establishes  the  connection  between  near  aggre¬ 
gation  and  near  unobservability.  Section  5  shows  how  near  unobservability  is  related  to 
almost  pole- zero  cancellation.  Section  b  has  the  conclusions. 


Consider  the  system 
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x  =  Ax  +  Bu  , 
y  =  Cx  , 


(2.1) 


where  x€Rn  ,  u€Rm  ,  y€Rp,  and  (A,B,C)  are  appropriately  dimensioned  constant  matrices. 
After  i -steps  of  chained  aggregation  (Tse,  et  al,  1978).  (2.1)  is  transformed  into 
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(2.3) 


The  representation  (2.2)  has  the  property  that  MH,]r  /V[F-j+J  =  0  for  j=  1 . i—  1. 

Furthermore,  if  MF11+1]  =  0  or  Fli+I  =  O.  then  the  representation  (2.2)  is  the  General¬ 
ized  Hessen  berg  Representation  (GHR)  of  (2.1).  If  in  (2.2)  Fji+1  =  O,  then  an  obvious 
reduced  order  model  is  given  by 
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Ya'  =  F‘Ya‘  +  G'u  . 

y  =  H'yj  . 


(2.4) 


If  i=l  in  (2.4),  then  the  reduction  process  is  aggregation  (Aoki,  1%8).  Thus,  the  GHR 
extends  the  notion  of  aggregation  (Tse,  et  al,  1978).  When  Fil+I  =0,  we  say  that  (2.1) 
exactly  aggregates. 


Another  direction  of  extending  the  notion  of  aggregation  is  to  consider  systems  in 
which  none  of  the  super  diagonal  blocks  in  (2.2)  is  zero  but  one  of  them  is  small.  We  for¬ 
malize  this  idea  with: 


Definition  2.1.  Given  ,uo>0.  we  say  that  (2.2)  is  /c0-aggregable,  if  there  exists  a  block 
Fj.j+1  such  that  II  Fj  )+1  II  <  n0. 

□ 

Remark  2.2.  From  the  engineering  point  of  view,  we  are  interested  in  the  case  when  /xo  is 
small.  We  describe  those  systems  which  are  /x0_ag8re8a^(e  wben  fi0  is  small  as  nearly 
aggregable.  This  terminology  will  be  applied  again  in  Def.  4.2  and  Def.  5.3  below. 

□ 

Remark  2.3.  By  considering  the  dual  system,  the  GHR  and  Def.  2.1  can  be  applied  to  the 
pair  ( A.B)  in  (2.1 ). 

□ 

It  is  obvious  that  II  F- .+1  II  is  basis  dependent.  Thus,  for  Def.  2.1  to  be  useful,  we 
must  impose  further  structure  on  (2.1).  In  addition  to  the  usual  algebraic  structure  of 
(2.1  ).  we  will  attach  the  natural  inner  product  to  the  state  space  which  is  defined  with 
respect  to  the  given  basis  of  (2.1 ).  This  inner  product  leads  to  a  2-norm  on  the  underlying 


vector  space  given  by 


I!  x  II 


=  (xTx)'/2 


(2.5) 


In  this  paper  we  will  assume  all  norms  are  the  2-norm  in  (2.5)  or  the  corresponding 
induced  operator  norm. 

Once  (2.1)  is  given  wilh  an  inner  product,  we  require  that  all  operations  on  (2.1) 
preserve  this  structure.  This  implies  that  all  state  transformations  on  (2.1)  must  be 
orthogonal.  In  particular,  the  GHR  (2.2)  is  constructed  using  orthogonal  transformations 
(which  is  also  numerically  stable  (Van  Dooren,  1%1  )). 
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Lemma  2.4.  Suppose  that  the  GHR  (2.2)  is  constructed  from  (2.1)  using  orthogonal 
transformations.  Then  NFJJ+]II  is  unique. 

□ 

Proof:  Consider  (2.3)  with  i-1.  The  basis  associated  with  this  representation  is  not 
unique.  All  allowable  transformations  which  preserve  the  structure  of  (2.3)  (i.e., 
.V[h']  =  0  )  are  given  by 


Ti  O 

o  t2 


(2.b) 


where  T,  is  pxp  and  T,  and  T,  are  orthogonal.  It  follows  that 

II  E1  II  =  II  T,T  E1  T,  II  .  (2.7) 

An  induction  argument  completes  the  proof. 


□ 

Remark  2.5.  The  choice  of  the  inner  product  for  (2. 1 )  is  essentially  a  scaling  issue.  It 
may  be  implied  by  the  physical  variables  or  it  may  be  a  design  parameter  in  the  selection 


[? 


of  a  reduced  order  model. 


The  rest  of  this  report  is  devoted  to  interpreting  near  aggregability  in  system 
theoretic  terms.  We  will  start  with  a  geometric  interpretation  of  the  GHR  in  the  next  sec- 
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3.  GEOMETRY  OF  THE  GHR 


The  following  subspaces  play  a  fundamental  role  below. 


IX'finition  3.1  (Lindner,  el  al,  1982):  The  vector  |€.Y  is  an  element  of  the  j-th 
unobservability  subspace  if  £  =  x(0)  implies  y(0)  =  y(0)  =  •••  =  y(i  U(0)=0.  By 


definition,  L,  =  .V. 


□ 


The  unobservability  subspaces  L ■  are  intimately  related  to  the  GHR.  Let  e,  be  the  ith 
natural  basis  vector  and  ^  the  dimension  of  the  Fu  in  (2,2)  block  in  (2.2).  Define 


P,  =  Z  rr 

J=1 


Theorem  3.2  (Lindner,  et  al.  1982):  Assume  that  (2.2)  is  a  GHR.  With  respect  to  this 
basis: 


L  =  £  efe  ,  1  <  j  <  i 

k=p  +  l 


if  FlJ+I*0 


^i+l  ’  ^i.j+1 


□ 


1  he  proof  follows  immediately  from  Def.  3.1  and  (2.2). 


Corollary  3.3.  The  system  (2.1)  is  exactly  aggregable  iff  L{  coincides  with  an  A-invariani 
subspace  for  some  i. 
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tfc 


Proof:  Obvious  by  comparing  Theorem  3.2  and  (2.2). 

Corollary  3.3  gives  a  geometric  interpretation  ol  exact  aggregation.  If  (2.2)  is  not 
exactly  aggregable,  but  is  nearly  aggregable.  is  there  an  A-invariant  subspace  V' "near"  L? 
We  will  obtain  an  answer  to  this  question  below  and  so  obtain  a  geometric  interpretation 
of  near  aggregability. 

First,  to  interpret  "near"  we  give  a  norm  on  the  subspaces  of  X.  Note  that  this  norm 
is  derived  from  the  inner  product  on  ,V. 

Definition  3.4  (Kato.  1%6):  Let  U  and  V  be  subspaces  of  Rn.  The  gap  between  U  and  V 
is  the  number 


i 

S 


t(UX)  =  max]  sup  inf  II  v— u  II  ,  sup  inf  II  v— u  II  }  . 

It  u  II  =1  .v €  V  II  v  lt=l.u€f 

u  »*  r  vtV 

□ 

Since  we  are  using  the  two-norm,  r(U,V)  =  1  if  U  and  V’  have  different  dimensions. 


A  useful  geometrical  interpretation  of  the  gap  enters  through  the  use  of  canonical 
angles. 


.  •  * 
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Definition  3-5:  Let  L  and  V  be  subspaces  of  Rn  with  orthonormal  bases  U  and  V,  respec¬ 
tively.  Let  o|  be  the  singular  values  of  IJTV.  Then  the  canonical  angles  between  L  and  T 
are  the  numbers 

0  =  COS  !  (7  . 

I  > 

I  he  gap  Junction  is  related  to  canonical  angles  as  follows. 
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Proposition  3.6  (Stewart.  1973):  r(L'.V')  =  I  sin  0max  I  . 

□ 

Hence,  if  all  the  canonical  angles  between  two  subspaces  are  small,  they  are  close  in  the 
gap  topology. 

Next,  we  show  how  to  compute  the  gap.  In  what  follows,  we  assume  that  the  system 
.  given  in  the  basis  (2.3)  and  we  are  trying  to  find  the  gap  between  and  an  A-invariant 
subspace  \  (although  the  method  developed  by  Stewart  ( 1973),  can  be  used  on  an  arbi¬ 
trary  pair  of  subspaces). 

The  angles  between  two  subspaces  can  be  computed  in  the  following  way.  Suppose 
that  the  natural  orthonormal  basis  of  Rn  yields  a  basis  for  and  L{,  respectively.  In 
matrix  form 

1p ,  O 

=  [Lc  L,]  (3.1) 

°  In— p, 

where  the  first  r  columns  span  Ll  and  the  last  n-r  columns  span  Lr  Let  a  second  (n— p,) 
dimensional  subspace  V  and  its  complement  be  spanned  by  the  orthogonal  basis 
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LTV  =  (I  +  PTP)  1/2 


(33) 


Let  P  have  singular  values  o,.  Then  the  canonical  angles  between  L1  and  are  given  by 

9  =  cos  1  ( 1  +  of)  /2  •  (3.4) 


It  follows  that 


a  j  =  tan  , 


(3.5) 


and 


rt /,.'■)=  I  sin  I  «  HP  I  =  ltane„„l 


(3.6) 


Thus,  the  matrix  P  can  be  used  to  compute  the  gap  between  subspaces. 


Let  us  now  return  to  the  problem  of  interpreting  near  aggregability  in  geometric 
terms.  Corollary  3.3  suggests  that  near  aggregation  might  be  characterized  geometrically 
as  an  invariant  subspace  near  Ly  The  GHR  suggests  that  this  situation  occurs  when 
II  Fjrl  II  is  small.  To  check  for  this  geometry,  we  look  for  ( n— p^-dimensional  invariant 
subspaces  of  the  form 


V  =  sp 


(3.7) 


If  such  a  subspace  exists,  then  (3.6)  holds,  from  which  t{L  .V)  can  be  computed.  The  fol¬ 
lowing  example  shows  that  there  exists  systems  which  have  this  geometry. 

Example  3.7.  The  linearized  model  of  a  biomethanization  process  (loannou  and 
Opdenacker.  1683)  is  given  as 


i 


m 


4 


yl 

•r  I 

4.1 


1—30.164  -78.631 
1  1  0 


y  =  [1  0  0  0] 


-2 1 .654  .0655 
0  -.07 


For  simplicity,  we  assume  that  the  inner  product  is  the  natural  one  with  respect  to 
the  basis  in  (3.8). 

The  model  (3.8)  is  a  GHR.  Note  that  F34  =  .0955  is  clearly  small.  Looking  over  all 
one-dimensional  eigenvectors,  we  find  that  the  eigenvector  corresponding  to  the  pole1  at 


-0.75  is  computed  as 


v  =  [PT  1  ]  =  [  — .49E— 2,4  .725E— 3.-.787E-4, 1 1 


(3.9) 


Therefore,  the  angle  between  V  =  sp(  v )  and  Li  is 


0  =  tan  '  I  P  I!  =  0.28°  . 


3.10) 


t(V,L3)  =  sin  0  =  0.005  . 


(3.1 1 


Below,  we  will  show  that  under  certain  conditions  a  nearly  aggregable  system  does 
have  an  invariant  subspace  near  L]  for  some  j.  But  first,  it  is  obvious  from  Theorem  3.2 
that  exact  aggregation  and  near  aggregation  are  intimately  related  to  observability.  In  the 
next  section  we  will  restate  the  geometry  above  in  terms  of  observability  anti  then  show 
how  the  observability  property  is  related  to  near  aggregability. 


Vr  also  Example*  5.10  heiow. 


4.  NEAR  AGGREGATION  AND  NEAR  UNOBSERVABILITY 


Ii  is  clear  from  Definition  3.1  that  L,  =  MC]  and  LicLj+,.  Therefore,  if  Lx  coincides 
with  an  A-in variant  subspace.  (2.1)  is  unobservable. 


Lemma  4.1  ( Aoki,  1%8,  Tse,  et  al,  1978):  The  system  (2.1 )  is  exactly  aggregable  iff  it  is 
unobservable. 


The  analysis  of  Section  3  suggested  that  nearly  aggregable  systems  might  have  an  A- 
invariant  subspace  close  to  Lv  The  observability  interpretation  above  suggests  the  follow¬ 
ing  definition  for  this  geometrical  condition. 


Definition  4.2  (Lindner  and  Perkins,  1984):  Given  0<^,<1,  we  say  the  system  (2.1)  is 
e0-unobservable  if  there  exists  an  A-invarianl  subspace  V  such  that 

AWL) <  t 


I  or  some  i. 


(  See  Remark  2.2. ) 

We  might  guess  that  nearly  aggregable  systems  are  nearly  unobservable.  However, 
this  is  not  true  in  general  as  the  following  example  shows. 


Example  4.3  Consider 


i 


0 

A 

1 

= 

X  + 

u  , 

-A 

0 

0 

(4.1) 

y  =  [  1  0]  x  . 

For  £i>0,  there  does  not  exist  a  1-dimensional  invariant  subspace  so  (4.1)  cannot  be 
nearly  unobservable. 

□ 

Roughly  speaking,  nearly  aggregable  systems  are  nearly  unobservable  if  there  exists 
an  appropriately  dimensioned  invariant  subspace.  To  develop  this  idea,  we  first  give  con¬ 
ditions  under  which  these  invariant  subspaces  of  a  given  dimension  exist.  The  following 
result  by  Stewart  (  1973)  is  useful. 

Suppose  that  P  in  (3.2)  is  used  to  define  a  change  of  basis  in  Rn.  (Note  that  P  can  be 
considered  a  kind  of  generalized  rotation.)  Since  V  is  A-invariant,  the  (1,2)  block  in  the 
new  representation  should  be  zero.  Using  the  representation  of  A  in  (2.3),  we  have 

VcTAV  =  (I  +  PPr  rVi  (F'P  -  PA'  +  E'  -  PK'P)  (I  +  PPTf  *  =  0  .  (4.2) 

Each  solution  P  of  the  Riccati  equalion  in  (4.2)  corresponds  to  a  particular  n  —pt  dimen¬ 
sional  invariant  subspace  V  of  the  form  (3.7).  Thus,  (4.2)  will  have  a  solution  P  if  and 
only  if  subspaces  of  the  form  (3.7)  exist.  If  P  satisfies  (4.2)  we  can  rewrite  that  equation 


F'P  -  PA'  =  PK'P  -  E' 


(4.3) 


We  can  bound  the  roots  of  (4.3)  as  follows.  First  note  that 

T(P)  =  F'P- PA'  (4.4) 

is  a  linear  operator  in  P.  Hence,  if  K  is  small  enough  the  quadratic  term  in  (4.3)  can  be 


15 


neglected  and  we  obtain  the  linear  equation 

TCP)  =  — E'  .  (4.5) 

This  equation  has  a  solution  if  TCP)  is  invertible.  It  can  be  shown  (Stewart,  1973)  that  if 
(4.5)  has  a  solution  then  (4.3)  has  a  solution  (under  appropriate  conditions).  Thus, 
existence  of  solutions  of  (4.3)  is  related  to  the  invertibility  of  TCP).  So  motivated  we 
define 

Definition  4.4  (Stewart.  1973):  The  separation  of  F1  and  A1,  denoted  by  8,  is  defined  as 

II  T-1  tl  0£\(T) 

8  = 

O  0  €  MT) 

, 

where  A(T)  denotes  the  eigenvalues  of  T. 

□ 

Remark  4.5.  It  is  well  known  that  T  has  nonzero  eigenvalues  iff  F‘  and  A'  have  no  com¬ 
mon  eigenvalues. 

□ 

Remark  4.6.  Varah  (  1979)  discusses  the  separation  function. 

□ 

Also  note  that  from  the  approximate  linear  equations  we  get 

II  Pll  ^  II  T_1  II- IIP' II  .  (4.6) 

The  ideas  sketched  above  are  stated  precisely  in  the  following  theorem. 

Theorem  4.7  (  Stewart.  1673  ):  Let  8  =  II  T  1  II  1  ,  y  =  II  F*  II  .  q  =  II  K‘  II  .  Then  if 


A  *«  I- 


vlvJvCsOv 


'  ■■  .y ■e-'  Trefv'V''  e  -a  -T o.  oST  '<-• 


« 
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yv  i 


(T.l  ) 


there  exisis  a  matrix  P  which  satisfies  (4.9)  such  that 


Pll  < 


(T.2) 


Remark  4.8.  Statement  (T.2)  of  Theorem  4.7  provides  an  estimate  of  the  distance  from 
Lt  to  V  which  is  computed  in  terms  of  the  known  data  in  (2.3). 


Remark  4.9  As  II  E1  II  — *  0  then  V— >  L.  (in  the  gap  topology)  as  suggested  by  the  GHR. 
However,  we  see  that  the  "smallness"  of  II  E1  II  is  measured  against  the  separation  of  F'  and 


Definition  4.10.  The  decomposition  (2.3)  is  called  separable  if  the  separation  of  F'  and 


A  is  nonzero. 


In  the  following  we  will  relate  separable  decompositions  to  near  unobservability  and 
near  aggregability.  The  idea  is  to  study  the  system  structure  as  II  F,  1+1  H  — »  0  in  one  case, 
and  as  V  — *  A  in  the  other.  To  accomplish  this  analysis  we  will  parameterize  the  system 
matrix.  Note  that  these  parameterizations  only  change  the  operator  on  .V  but  leave  the 
underlying  structure  of  .V  (specifically  the  inner  product )  invariant. 

Consider  (2.3)  and  parameterize  the  system  in  /j.  by  replacing  F,  1+l  with  /iF,  +1. 


Corollary  4.11.  Suppose  that  (2.3)  is  separable.  Then  there  exists  a  /i  such  that  for  all 
0  <  /x  ^  /u  there  exists  an  (n— •/)  ^-dimensional  A-invariant  subspace. 


V  \ 
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3  S 

1 


I 


(  ' 


I 


Proof:  Immediate  f  rom  Theorem  4.7. 


□ 


Ihe  connection  to  near  unobservability  is  now  obvious.  Again  parameterize  (2.3)  on 
ti  as  above.  Let  c0  be  given. 

Proposition  4.12.  Suppose  that  (2.3)  is  separable.  Then  there  exists  a  /jl0  such  that  for  all 
0  <  /U  ^  m0.  the  system  is  e0-unobservable. 

□ 

Proof:  By  Corollary  4.11  there  exists  /x  which  guarantees  the  existence  of  an  ( n — p; )- 
dimensional  invariant  subspace.  Select  /x0  <  fx  such  that 


V. 


M0< 


2  II  Fi‘  II 


Theorem  4.7  together  with  (4.7)  implies  that  II  P  II  <  €,.  Now  the  proposition  follows 
from  ( 3.6 ). 

C 

We  can  establish  a  converse  to  Proposition  4.12  as  follows.  In  (2.3),  suppose  there 
exists  an  invariant  subspace  \'  which  has  a  basis  ol  the  form  (3.7).  from  (2.3)  define  a 
parameterized  set  of  systems  as  follows: 


x  —  [•  (  ()X  +  (ill, 


4.8: 


V  =  n'x  , 


where  (i  and  H  are  identical  to  (2.3  land 
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!•"(  e)  = 


—  e  P I  Ai  O  II  tP 


10  I  I  ,\12  A,  IO  I 


Note  that  at  «=  1,  (4.8)  and  (2.3)  are  identical.  Also  note  that  r(L.V’)— »0  as  t— *0. 


I  .el  m0  be  given. 


Proposition  4.13.  Consider  the  system  defined  by  (4.8).  There  exists  an  ec>0  such  that 


for  all  0<  e<  e0,  (4.8 )  is  /i0-aggregable. 


Proof:  From  (4.9 )  we  have 


E'U)=  €.\ .  P  —  PA  ,  €  —  CPA,,  P 


(4.10) 


so  that  if 


IIE'U)!!  <  II  AjP  —  PA,  i  II  PA, ,P  II  C 


=  ae  +  be  <  U 


(4.1  1  ) 


then  (4.8)  is  //,-aggregable.  The  last  inequality  in  (4.1  1)  is  satisfied  lorall()<f<e0  if  ^  is 


chosen  less  than  the  positive  root  of 


be'  +  ae  —  /jl  =  0  . 


(4.12 


5.  POl  K-ZKRO  CANCt'I  LATION 


In  the  previous  sections  wi  have  related  (near)  aggregation  to  (near  im-)  observabil¬ 
ity.  W  e  ran  extend  these  o  ..  'pts  to  iransfer  (unction  matrices  by  generalizing  the  fol¬ 
lowing  well  known  result:  It  the  system  is  unobservable  then  the  transfer  function 
matrix  exhibits  a  pole- zero  cancellation.  Thus,  exact  aggregation  corresponds  to  pole-zero 
cancellation.  Below,  we  will  relate  near  aggregation  and  near  unobservability  to  almost 
pole-zero  cancellation. 

Consider  (2.3).  In  this  section  we  assume  that 


G'  = 


0 


0 

G. 


(5.1) 


and  that  G  is  nonsingular.  This  assumption  is  equivalent  to  assuming  the  number  of 
inputs  equals  the  number  of  outputs  and  the  first  nonzero  Markov  parameter  is  nonsingu¬ 
lar. 


Define  the  following  state  transformation 


'■\ 

1  () 

i 

>'a 

Z, 

X  I 

l 

X  =  (<)  — B.^jG,  1  ]  . 

Substituting  (  5.2  )  into  f  2.3  )  we  oi;tum 


y  =  [H1  o] 


Remark  5.1.  It  is  shown  in  Lindner  ( 1982)  that  (5.3)  has  the  following  properties: 


1)  V*  =  Lx  =  sp  ,  where  V*  is  the  largest  ( A,B)-invariant  subspace  in  N[C]  (Won- 


ham,  1979). 

2)  R  =  0.  where  R  is  the  largest  controllability  subspace  in  N[C]  (Wonham,  1979).  and, 

3)  the  invariant  zeros  of  (5.3)  are  the  eigenvalues  of  S4. 

It  is  interesting  that  the  invariant  zeros  should  be  explicitly  displayed  in  (5.3).  We  will 
use  this  structure  below. 


Remark  5.2.  The  natural  inner  product  implied  by  the  basis  in  (5.3)  is  not  the  same  as 
the  natural  inner  product  implied  by  the  basis  in  (2.1)  (see  (5.2)).  In  this  section  we 
assume  that  the  underlying  system  leads  to  no  natural  inner  product  for  (2.1).  In  this 
case  the  choice  of  inner  product  is  arbitrary  and  we  choose  the  natural  inner  product  asso¬ 
ciated  with  the  basis  in  (5.3).  See  also.  Remark  5.9  below. 


l  o  see  w  here  we  are  goin^,  suppose  that  the  system  in  (5.31  has  an  ( n— p-  )- 


dimensional  invariant  subspace  V'  of  the  form 
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5 
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*. * 

l 

1 


*? 

'-i 


:3 


p 


r  =  sp 


(5.4) 


Furthermore,  suppose  that  (5.3)  is  c0-unobservable  so  that 

^  fo 

II  PH  <$  - ,  (5.5) 

( see  ( 3.6 )).  Using  (3.2 )  to  define  a  change  of  basis  =  Pz,  it  can  be  shown  (Stewart,  1973) 
that 


A(S)  =  \(S,  -PS3)U\(S4  +  S3P).  (5.6) 

Recall  that  pt€ \(S4+S3P)  are  the  poles  of  (5.3)  (see  (5.6))  and  z; €  A.( S4 )  are  the  invari¬ 
ant  zeros  ol  (5.3)  (Remark  5.1 ).  Therefore,  if  S3P  is  a  "small"  perturbation  of  the  eigen¬ 
values  of  S.,,  then  (5.3)  should  exhibit  an  almost  pole-zero  cancellation. 

Note  that  how  close  ihe  poles  and  zeros  are  depends  on  the  block  S?  in  (5.3).  This 
blcx  k,  which  is  associated  with  the  controllability  structure  of  (5.3).  has  not  played  a 
direct  role  in  our  results  so  far.  Intuitively,  its  appearance  in  our  analysis  says  that  the 
controllability  structure  as  well  as  the  observability  structure  plays  a  role  in  determining 
almost  pole  zero  cancellations.  (  This  is  in  contrast  to  cancellations  in  which  only  one  con¬ 
cept  is  involved  per  cancellation.)  We  can  conclude  both  concepts  in  our  analysis  here  by 
extending  Definition  2.1.  Let  /Uo>0  be  given. 


Definition  5.3.  The  system  (5.3)  is  /x,-input/output  aggregable  ii  II  S2  II  <  and 

H5,il  <  /v 

□ 

Remark  5.4.  It  is  easily  seen  by  duality  that  all  of  the  previous  result  on  observability 


* 
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m 


iK  v 


i 


„£ 

>7 

i* 


have  corresponding  results  on  controllability.  By  the  dual  of  the  results  ol  Section  4,  near 
input  aggregability  can  be  related  to  near  uncontrollability. 


We  can  characterize  the  deviation  of  the  eigenvalues  of  S4.  say,  1  rom  the  eigenvalues 
of  S  by  applying  an  eigenvalue  perturbation  theorem. 


Theorem  5.5  (Bauer  and  Fike.  1%0):  Let  X  's4X  =  diag(  )  and  define 

A(X)  =  II  X  I!  •  II  X  1  II .  Let  /u  be  an  eigenvalue  of  S4+S,P.  Then 


min  U-^l  <  ATX )  II  S,P  II 


Theorem  5.6.  Assume  that  (5.3)  is  /u.0-input/output  aggregable  and  e0-unobservable. 
Also,  assume  S4  in  (5.3)  has  simple  structure.  Then 


min  I  p  — z,  I  ^  ATX) 


Proof:  C'ombme  T  heorem  5.3  and  (5.5). 


Remark  5.7.  Roughly  speaking,  Theorem  5.4  says  that  a  system  which  is  nearly 
input/oulput  aggregable  and  nearly  unobservable  exhibits  an  almost  pole  zero  cancella- 


Fxample  5.8.  (  onsider  the  transfer  t  unction 


(',( s)  = 


(  st  t )( sT  1  ) 


(5.7) 


,7 ^ v’vTvz, 


>  *  ^  *_•  .  *  «*  ‘ 


Realizing  (  5.7  )  in  con’ Tollable  canonic  lorin  we  obtain 


'l 

-(  1  +  c)  -t  I 

v. 

1 

= 

+ 

1  ()  1 

z, 

0 

Then  we  can  check  the  near  unobservabij/ty  of  15.8)  by  computing  a  basis  of  the  form 
(5.4)  lor  the  eigenvector  associated  with  the  eigenvalue  c.  In  fad,  P  is  given  by 

P  =  tan  0  =  —  c  .  (5.9) 

I  his  equation  clearly  shows  the  relationship  between  near  aggregation  and  near  unobser- 
vability.  These  two  concepts  are  related  to  pole-zero  cancellation  by  applying  Theorem 
5.b: 

I  t— 01  <  |  •  I  <1  .  (5.10) 

Equation  (5.10)  is  a  state  space  version  of  the  pole-zero  cancellation  evident  in  (5.7). 


Note  that  these  results  do  not  depend  on  eigenvalue  separation. 


□ 


Remark  5.9.  Note  that  (5.8)  is  a  Aio-aggregabie  for  but  -input/output  aggregable 

for  I  </i,.  Suppose  we  scale  z,  by 


Zj  =  \fe  Zj  .  (5.11) 

Ai  ter  substituting  (5.1  1  )  nto(5.8),  the  new  representation  is  /u. ,-input/output  aggregable 
lor  any  /u  ,  ^  '/*•!  but  not  /u^-input/output  aggregable  for  any  /z,  <  J~t.  On  the  other 


•.  s 


rrwm 


7 


v 


it 


■  \ 
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hand  a  scaling  similar  to  (5.11)  could  make  (5.8)  ^0-agg regable  (or  any  fxa.  Apparently, 
there  is  a  natural  ya,  for  input/output  aggregation  associated  with  (5.8).  This  idea  is  com¬ 
pletely  worked  out  for  single-input  single-output  systems  by  Lindner  (  1084).  It  is  shown 
that  the  geometric  structure  leads  to  a  natural  basis  in  the  state  space.  The  metric  associ¬ 
ated  with  that  basis  is  then  a  natural  setting  for  near  unobservability. 

□ 

Example  5.10.  Consider  again  the  system  in  Example  3.7.  This  system  is  nearly  unob¬ 
servable.  1  he  transfer  function  of  this  model  is 


Y(s) 

U(s) 


=  P(s)  = 


(s  +  .07) 


:  s+. 0740  )(s+4 .04084  )(s+ 1 7. 1 785  )(s+.35055 ) 


(5.12) 


which  shows  an  almost  pole-zero  cancellation. 

□ 

Example  5.11.  This  example  shows  that  when  the  assumption  at  the  beginning  of  the 
section  is  relaxed,  the  results  fail.  Consider  the  transfer  function  matrix  (Rosenbrock, 
1070) 


P(s)  = 


1 


1 


Is-H  ) 


1 


i  s+  1  )( s+2 ) 


(s+I  )(s+2! 


s+3 

(s+2! 


(5.13) 


This  matrix  has  Smiih-McMillan  form 


V1(  s )  = 


I 


( s+ 1  H  s+2  )- 
() 


O 


(5.14 


L? 


which  shows  a  pole  and  zero  at  -2  but  does  not  exhibit  a  pole-zero  cancellation.  A  state 
space  representation  of  this  system  is  given  by 


'-2  4  '  1  0 

o 

o 

y 

-1  2  '  1  0 

1 

y 

0  1 

= 

+ 

-  -  - 

Z 

3  -7  '  -4  1 

z 

1  -3 

i 

1 

u 

1 

IO 

o 

o, 

1  0  '  0  0 

1 

y 

o 

o 

o 

z 

(5.15) 


Note  that  G,  is  singular  in  (5. 15 ).  Also  note  that  V"  =  L-,. 


It  can  be  checked  that  the  Jordan  form  of  A  in  (5.15)  has  a  block  of  order  2 
corresponding  to  each  eigenvalue  \=— 1  and  A=— 2.  Therefore,  there  are  two  one¬ 
dimensional  invariant  subspaces  which  are  possibly  near  L2 •  The  eigenvector  correspond¬ 
ing  to  \=— 2  is 


v_T,  =  [  .889,  -.111,  -.444.  1  ]  =  [  PT  1  ]  . 


(5.16) 


The  angle  between  l'_,  and  L-,  is 

0  =  tan-1  il  Pll  =  45° 


(5.17) 


so  that 


r(  V_,,  L2)  =  sin  0  =  0.707  .  (5.18) 

I  he  eigenvector  corresponding  to  A  =  —  1  is 

V_T,  =  [  -.250.  -.50.  - 1 .75.  1  ] .  (5.19) 

We  have 


r(V_ ,,  LJ  =  sin  6 1.4°  =  0.878. 


(5.20) 


Since  r(l\ /.,)=  1  for  any  larger  dimensional  invariant  subspace,  we  conclude  that  the 
system  is  nol  nearly  unobservable.  Therefore,  we  would  not  predict  anv  almost  pole-zero 
cancellation.  Also  note  that  (5.15)  is  not  nearly  aggregable. 


6.  CONCLUSIONS 


W  e  have  generalized  the  notion  ol  aggregation  to  near  aggregation  by  using  the  GHR. 
This  generalization  required  the  introduction  of  a  melric  into  the  state  space  model,  a 
novel  feature  of  our  approach.  We  then  showed  that  if  there  exists  an  appropriately 
dimensioned  invariant  subspace,  then  near  aggregation  was  equivalent  to  near  unobserva- 
bility  1  rom  a  geometric  viewpoint. 

Under  certain  restrictions,  it  is  possible  to  interpret  the  previous  results  as  pole-zero 
cancellations  in  the  transfer  functions.  In  order  to  interpret  near  unobservability  in 
terms  of  pole-zero  cancellations,  we  had  to  assume  the  first  non-zero  Markov  parameter 
was  nonsingular.  This  assumption  can  be  relaxed  for  single-input-single-outpul  systems 
(Lindner  1984).  The  multivariable  case  is  under  investigation. 
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